rank | frequency | n-gram |
---|---|---|
1 | 7287 | -e |
2 | 4463 | -n |
3 | 4256 | -r |
4 | 3961 | -t |
5 | 2667 | -s |
rank | frequency | n-gram |
---|---|---|
1 | 3637 | -er |
2 | 3458 | -en |
3 | 2103 | -et |
4 | 1572 | -ne |
5 | 1510 | -de |
rank | frequency | n-gram |
---|---|---|
1 | 985 | -rne |
2 | 682 | -ing |
3 | 640 | -ede |
4 | 586 | -sen |
5 | 580 | -ter |
rank | frequency | n-gram |
---|---|---|
1 | 968 | -erne |
2 | 379 | -ende |
3 | 367 | -ngen |
4 | 294 | -ning |
5 | 255 | -nger |
rank | frequency | n-gram |
---|---|---|
1 | 339 | -ingen |
2 | 231 | -inger |
3 | 148 | -elsen |
4 | 138 | -gerne |
5 | 136 | -erede |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings